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ABSTRACT 


Specifications  for  the  SIMULA  file  classes  which  are  proper 
extensions  of  the  Common  Base  Definition  are  given.  The  following 
assumptions  are  used  in  the  design.  (1)  A  file  is  named  only  by 
a  file  specification  (an  extension  of  a  CMS  fileid) ;  simply  men¬ 
tioning  the  name  of  a  file  causes  it  to  exist.  (2)  A  file  is  a 
sequence  of  characters  divided  into  records  by  <newline> .  This  is 
implemented  using  standard  CMS  files  in  a  way  that  is  transparent 
to  the  program.  (3)  A  file  may  be  written  in  one  of  four  char¬ 
acter  sets:  EBCDIC,  key-paired  APL,  bit-paired  APL  and  the  IBM 
APL  print  train  set.  (4)  Independent  of  the  file  character  set, 
the  file  may  be  read  in  EBCDIC  or  key-paired  APL  with  appropriate 
character  translation  provided  by  the  file  system. 


*  SIMULA  is  a  registered  trademark  of  the  Norwegian  Computing 
Center,  Oslo,  Norway. 
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of  the  file  character  set,  the  file  may  be  read  in  EBCDIC  or  key-paired  APL 
1  with  appropriate  character  translation  provided  by  the  file  system. 


Perhaps  the  easiest  way  to  motivate  the  file  system  design 
described  here  is  to  explain  how  the  author  was  motivated  to 
undertake,  a  substantial  program  development  effort  that  is  quite 
orthaqonal  to  his  research  interests. 

I  brought  several  large  strongly  interactive  DEC-10  SIMULA 
programs  to  Virginia  Tech  for  use  in  VM/CMS.  After  resolving 
problems  related  to  character  set  translation  and  a  few  bugs  in 
the  IBM  SIMULA  compiler,  the  programs  behaved  more  or  less  the 
same  in  VM  and  in  TOPS- 10.  There  was,  however,  one  important 
problem  area  that  remained:  terminal  dialog  and  file  management. 
After  three  months  of  work  on  ad  hoc  program  modifications  to 
solve  problems  and  on  learning  some  of  the  obscure  details  of 
CMS,  I  decided  that  a  general  solution  to  these  problems  was  the 
only  reasonable  way  to  proceed. 

My  first  approach  to  this  problem  was  to  design  and  implement 
a  SIMULA  class  DIALOG  which  contains  procedures  to  efficiently 
rewrite  the  terminal  transaction  part  of  my  programs.  The  design 
of  all  of  my  programs  assumed  that  a  file  can  be  opened  at  run 
time  simply  by  mentioning  the  name  of  the  file.  CMS  SIMULA,  as 
distributed,  requires  that  files  be  given  DD  names  using  filedef 
commands  before  program  execution  begins.  It  became  very  obvious 
that  it  was  completly  unreasonable  to  redesign  my  programs  to 
reference  files  in  this  way.  Before  beginning  the  verification 
of  programs,  one  simply  doesn't  know  all  the  file  names  that  will 
be  used.  Therefore,  this  first  version  of  DIALOG  had  some  rudi¬ 
mentary  procedures  for  naming  files  at  run  time  using  only  CMS 
fileids.  This  version  of  DIALOG  is  described  in  TM  79-3. 

It  very  quickly  became  obvious  that  this  version  of  DIALOG  was 
inadequate  with  respect  to  dynamic  file  naming.  There  were  sim¬ 
ply  too  many  tilings  that  could  go  wrong  resulting  in  a  termina¬ 
tion  of  execution  and  substantial  loss  of  work.  R.  E.  Porter 
then  made  major  changes  in  DIALOG  to  provide  adequate  security 
against  errors  in  fileids  and  other  input/output  errors.  This 
new  version  proved  quite  useful  and  solved  many  problems.  There 
were  still  a  few  details  of  CMS  file  format  that  intruded  occa¬ 
sionally  but  they  could  be  solved  with  some  special  code  and/or 
the  copy  command. 

At  this  point,  one  important  problem  appeared  to  remain:  It 
was  not  possible  to  write  terminal  output  without  a  trailing 
<cr><lf>.  In  addition,  two  calls  to  Outimage  (which  caused 
redundant  blank  lines  in  the  terminal  transcript)  were  required 
to  have  prompts  precede  the  read  for  a  response.  To  solve  these 
problems,  R.  D.  Johnson  wrote  two  assembly  procedures  to  bypass 
the  SIMULA  system  output  code  and  these  procedures  were  imbedded 
in  class  Outfile.  This  quite  adequately  solved  the  problem  but 
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the  whole  set  of  programs  were  now  quite  baroque  (2500  lines  of 
SIMULA,  300  lines  of  assembler) .  This  version  of  DIALOG  is 
described  in  TM  79-3a. 

Both  the  program  verification  system  and  its  associated  APL 
implementation  must  interact  with  both  ASCII  and  APL  character 
set  devices  and  there  are  three  APL  character  set  codes:  key- 
paired  APL,  bit-paired  APL  and  the  IBM  APL  print  train  character 
set.  Character  set  management  and  translation  is  an  appropriate 
part  of  the  input/output  system  and  extensions  to  DIALOG  to  pro¬ 
vide  this  capability  were  designed  and  experimental  implementa¬ 
tions  were  studied.  It  became  quite  obvious  that  programs  built 
on  top  of  such  a  DIALOG  would  be  inefficient  and  lack  reliability 
because  such  a  DIALOG  would  be  extended  far  beyond  what  was  anti¬ 
cipated  in  the  original  design. 

Therefore,  a  comprehensive  design  of  a  file  system  that  is  a 
proper  extension  of  the  SIMULA  Common  Base  Definition  was  under¬ 
taken.  The  view  of  files  adopted  in  this  design  is  quite  differ¬ 
ent  from  the  standard  CMS  view  and  should  prove  to  be  much  easier 
to  use. 

The  first  assumption  is  that  a  file  is  named  only  by  a  file 
specification.  Moreover  simply  writing  to  a  file  after  giving 
its  file  specification  is  sufficient  to  cause  the  file  to  exist. 
Running  programs  may  dynamically  reference  both  input  and  output 
files  using  only  the  file  specification.  The  concept  of  a  DD 
name  does  not  exist.  If  the  specification  of  a  file  changes 
between  executions  of  a  program,  then  the  file  specification  is 
read  as  data. 

The  second  assumption  is  that  a  file  may  be  associated  with 
any  input  or  output  device  and  that  from  the  vantage  point  of  a 
running  program  there  is  no  difference  between  devices  except  for 
the  text  of  the  file  specification.  The  various  file  formats  of 
CMS  are  of  no  concern  to  the  program. 

The  third  assumption  is  that  a  file  is  a  sequence  of  charac¬ 
ters  divided  into  records  by  <newline> .  One  does  not  distinguish 
between  fixed  and  variable  length  records,  etc.,  etc.  Files  are 
read  or  written  one  record  or  line  at  a  time  without  distinction. 

It  is  assumed  that  the  data  in  a  file  may  be  written  in  one  of 
four  character  sets:  EBCDIC,  key-paired  APL,  bit-paired  APL  and 
APL  print  train  code,  furthermore,  a  program  may  interpret  char¬ 
acters  coming  from  a  >:ile  or  written  to  a  file  as  either  a 
sequence  of  EBCDIC  characters  or  as  a  sequence  of  key-paired  APL 
characters.  It  is  the  responsibility  of  the  file  system  to  pro¬ 
vide  the  appropriate  translation. 

A  design  requirement  is  that  the  files  actually  read  and  writ¬ 
ten  must  be  standard  CMS  files  so  that  programs  written  using 
this  file  system  can  communicate  with  other  programs  by  way  of 
the  CMS  file  system. 
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The  file  specification  used  here  is  an  extension  of  the  CMS 
fileid  to  include  device  names  and  character  set  specifications. 
The  file  system  uniformly  communicates  with  disk  files,  the  ter¬ 
minal  and  the  virtual  reader,  printer  and  punch.  Magnetic  tapes 
are  not  included  because  they  are  not  available  to  VM  users  at 
Virginia  Tech. 


2.  Files 


In  a  running  program,  a  file  is  viewed  as  a  sequence  of  char¬ 
acters  divided  into  records  or  lines  by  <new  line>  characters. 
Files  are  read  or  written  one  record  at  a  time  and  individual 
records  may  be  blocked  into  a  standard  length  by  the  input/output 
procedures  to  simplify  programming.  Except  for  the  file  specifi¬ 
cation,  a  running  program  need  not  distinguish  between  different 
input/output  devices  or  the  character  set  of  the  device.  This 
means  that  the  programs  that  read  from  or  write  to  a  peripheral 
must  present  a  uniform  interface  to  user  programs. 

The  file  system  described  here  recognizes  disk  files  and  four 
peripheral  devices: 

Device  File  Specification 


terminal 
virtual  printer 
virtual  punch 
virtual  reader 


tty:  or  con: 
lpt:  or  prt: 
pun: 
rdr : 


There  are  two  formats  for  file  specifications  for  disk  files  and 
these  two  formats  define  different  actions  on  the  part  of  the 
input/output  system. 

The  more  commonly  used  file  specification  for  a  disk  file  is 
the  usual  CMS  fileid  which  is  of  the  form: 

<fn>[  <ft>(  <fm>]] 


The  blanks  in  the  file  specification  may  be  replaced  by  periods 
(.).  If  <ft>  and/or  <fm>  is  omitted,  default  values  are  pro¬ 
vided.  These  default  values  are  different  for  different  kinds  of 
files. 

For  Infiles,  if  <ft>  is  omitted  the  <ft>  is  set  to  DATA. 
After  the  <fn>  and  <ft>  are  completed,  the  <fm>  is  provided  as 
follows.  All  of  the  accessable  disks  of  the  active  job  are 
searched  for  a  file  with  the  given  <fn>  <ft>.  The  first  file 
that  matches  this  partial  specification  in  the  standard  search 
order  and  which  is  available  for  reading  is  selected. 
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For  Outfiles,  if  <ft>  is  omitted  the  string  LOG  is  provided  as 
<ft>.  After  the  <fn>  and  <ft>  is  specified,  the  file  mode  is 
provided  as  follows.  Each  disk  that  is  available  for  writing  is 
searched  in  the  standard  search  order.  The  following  is  a  sketch 
of  the  algorithm: 

IF  "there  is  file  <fn>  <ft>  on  the  disk" 

THEN  "replace  the  existing  file" 

ELSE  IF  "there  is  space  on  the  disk" 

THEN  "create  a  new  file 

<fn>  <ft>  on  the  disk" 

ELSE  "go  on  to  the  next  disk” 

Defaults  for  printfiles  are  completed  in  the  same  way  as  Out¬ 
files  except  that  the  default  <ft>  is  the  string  LISTING. 

A  single  period  is  acceptable  in  place  of  the  blanks  that  usu¬ 
ally  separate  the  components  of  a  CMS  fileid.  In  addition,  the 
string  <fn>..<fm>  specifies  the  <fn>  and  <fm>  while  selecting  the 
default  <ft>. 

The  second  form  of  the  file  specification  is  of  the  form 
f il:<string> 

where  <string>  is  any  EBCDIC  character  string.  This  file  speci¬ 
fication  indicates  that  the  actual  file  specification  is  to  be 
read  from  the  terminal  when  a  file  object  is  created;  see  Section 
4,  below. 

A  complete  file  specification  includes  a  character  set  speci¬ 
fication  which  is  described  in  Section  3,  below.  The  syntax  of  a 
file  specification  is: 

tty:  |  con:  |  lpt:  (  prt:  |  pun:  (  rdr:  | 
fil:<string>  | 

<fn>[{{  | . } <  f t>  [  {  | . }<fm>] | . ,<fm>} ] [<character  set  spec> ] 


3.  Character  Sets 


This  file  system  is  designed  for  use  with  APL  and  with  the 
usual  character  set.  It  is  assumed  that  each  peripheral  device 
may  have  any  one  of  four  different  character  sets  associated  with 
it.  The  four  character  sets  are: 

EBCDIC 

KEY 

BIT 

APL 

The  character  set  EBCDIC  refers  to  the  EBCDIC  equivalent  of 
ASCII  as  defined  by  the  translate  tables  used  with  the  Memorex 
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1380  at  the  Virginia  Tech  Computing  Center.  This  table  is 
attached  as  Appendix  A. 

The  character  set  KEY  refers  to  the  typewriter  pairing  that 
maps  the  APL  character  set  into  ASCII  as  defined  in  the  STAPL 
convention.  After  mapping  the  APL  characters  into  ASCII,  the 
ASCII  characters  are  mapped  into  EBCDIC  as  above.  The  STAPL  Con¬ 
vention  is  attached  as  Appendix  B. 

The  character  set  BIT  refers  to  the  bit  pairing  that  maps  the 
APL  character  set  into  ASCII  as  defined  in  the  STAPL  Convention. 
After  mapping  the  APL  characters  into  ASCII,  the  ASCII  characters 
are  mapped  into  EBCDIC  as  above. 

The  character  set  APL  refers  to  the  character  code  used  to  map 
EBCDIC  codes  into  printable  characters  using  the  APL  print  train 
for  IBM  printers. 

The  actual  contents  of  a  file  may  be  in  a  specific  character 
set  but  the  program  may  wish  to  read  or  write  the  file  using  a 
different  character  set.  For  example,  an  APL  interpreter  might 
well  wish  to  read  an  ASCII  file  as  though  it  were  a  key-paired 
APL  file  but  using  a  translation  that  permits  input  or  output  for 
devices  that  lack  the  APL  character  set.  Thus,  the  specification 
of  the  character  set  associated  with  a  file  must  define  both  the 
character  set  used  in  the  file  and  the  character  set  that  the 
program  is  interpreting.  However,  for  reasons  of  simplicity,  it 
is  desirable  to  have  only  one  APL  character  set  within  a  program. 
The  key-paired  APL  character  set  has  been  selected. 

There  are  two  forms  of  a  character  set  specification,  a  short 
form  and  a  long  form. 

A  short  character  set  specification  is  of  the  form: 

/<file  char  set>.<prog  char  set> 

<file  char  set>  is  the  character  set  in  which  the  file  is  read  or 
written  and  <prog  char  set>  is  the  character  set  in  which  the 
program  wishes  to  interpret  the  file. 

Not  all  combinations  of  character  sets  are  sufficiently  useful 
to  merit  implementation.  For  input  files,  the  following  charac¬ 
ter  set  specifications  are  implemented: 

/EBCDIC. EBCDIC 
/EBCDIC. KEY 
/BIT. KEY 
/APL. KEY 

For  output  files,  the  following  character  set  specifications  are 
implemented: 


/EBCDIC. EBCDIC 
/KEY . KEY 
/BIT. KEY 
/APL .KEY 
/EBCDIC . KEY 
/KEY. EBCDIC 
/BIT. EBCDIC 

If  the  character  set  specification  is  omitted,  the  default  for 
both  the  file  and  program  character  set  is  EBCDIC.  If  the  char¬ 
acter  set  specification  is  /EBCDIC  or  /KEY  then  both  the  file  and 
program  character  set  are  set  to  EBCDIC  or  KEY. 

The  short  form  of  the  character  set  specification  is  difficult 
to  read  and  remember  and  this  may  cause  program  errors.  The  long 
form  of  the  character  set  specification  provides  mnemonics  to 
make  it  easier  to  remember  the  format.  For  input  files,  the  fol¬ 
lowing  eight  long  form  character  set  specifications  are  accept¬ 
able  : 


/FILE  EBCDIC  PROG  EBCDIC 
/FILE  KEY  PROG  KEY 
/FILE  BIT  PROG  KEY 
/FILE  APL  PROG  KEY 


/PROG  EBCDIC  FILE  EBCDIC 
/PROG  KEY  FILE  KEY 
/PROG  KEY  FILE  BIT 
/PROG  KEY  FILE  APL 


For  output  files,  the  following 
set  specifications  are  acceptable: 

/FILE  EBCDIC  PROG  EBCDIC 
/FILE  KEY  PROG  KEY 
/FILE  BIT  PROG  KEY 
/FILE  APL  PROG  KEY 
/FILE  EBCDIC  PROG  KEY 
/FILE  KEY  PROG  EBCDIC 
/FILE  BIT  PROG  EBCDIC 


fourteen  long  form  character 


/PROG  EBCDIC  FILE  EBCDIC 
/PROG  KEY  FILE  KEY 
/PROG  KEY  FILE  BIT 
/PROG  KEY  FILE  APL 
/PROG  KEY  FILE  EBCDIC 
/PROG  EBCDIC  FILE  KEY 
/PROG  EBCDIC  FILE  BIT 


Since  the  short  form  of  the  character  set  specification  is  much 
simpler  when  only  one  character  set  is  used,  there  are  no 
defaults  for  the  long  form. 


The  specification  of  the  character  set  translations  is  com¬ 
pleted  when  each  of  the  eleven  translations  is  specified. 


If  the  file  character  set  and  the  program  character  set  are 
identical,  then  no  translation  is  performed  on  input  or  output. 
This  applies  to  the  specifications  /EBCDIC .EBCDIC  and  /KEY. KEY. 

If  the  character  set  of  an  input  file  is  EBDIC  and  the  program 
character  set  is  KEY  then  the  following  translation  is  performed. 


(1)  Lower  case  EBCDIC  letters  are  mapped  into  APL  let¬ 
ters. 


are  mapped  into 


(2)  Upper  case  EBCDIC  letters 
underlined  APL  letters. 

(3)  Escape  sequences  of  the  form  ,xx  are  mapped  into 
(one  or  three)  key  paired  characters  as  defined  in 
the  APLSF  and  APL. MS  implementations  (Appendix  C) . 

(4)  Those  EBCDIC  graphics  which  are  mapped  into  APL 
characters  by  the  translation  in  Appendix  C  are 
translated  into  key-paired  characters  as  speci¬ 
fied. 

(5)  Those  EBCDIC  graphics  which  are  not  mapped  into 
APL  characters  in  accord  with  (4)  are  mapped  into 
key-paired  characters  in  accord  with  the  inverse 
of  the  mapping  defined  by  the  array  key_paired 
shown  in  Appendix  D. 

If  the  character  set  of  an  input  file  is  BIT  and  the  program 
character  set  is  KEY  then  the  bit-paired  characters  are  mapped 
into  key-paired  characters  as  defined  by  Figures  4  and  5  of 
Appendix  B. 

If  the  character  set  of  an  input  file  is  APL  and  the  program 
character  set  is  KEY  then  the  input  characters  are  mapped  into 
key-paired  APL  so  as  to  preserve  the  appearance  of  the  line. 
Strikeovers  are,  of  course,  mapped  into  three  characters.  A  copy 
of  the  character  set  for  the  APL  train  must  be  secured  from  the 
Computing  Center. 

There  are  five  more  translations  available  for  output  files: 

For  output  files,  if  the  program  character  set  is  KEY  and  the 
file  character  set  is  BIT,  then  character  translation  as  defined 
by  Figures  4  and  5  of  Appendix  B  is  performed. 

For  output  files,  if  the  program  character  set  is  KEY  and  the 
file  character  set  is  EBCDIC,  then  character  translation  that  is 
performed  is  the  inverse  of  the  mapping  defined  for  input  with 
character  set  specification  /FILE  EBCDIC  PROG  KEY. 

For  output  files,  if  the  program  character  set  is  KEY  and  the 
file  character  set  is  APL,  then  the  character  translation  that  is 
performed  preserves  the  appearance  of  the  line  for  the  APL  print 
train.  The  character  codes  for  the  print  train  must  be  secured 
from  the  Computing  Center. 

For  output  files,  if  the  program  character  set  is  EBCDIC  and 
the  file  character  set  is  KEY,  then  the  output  is  translated  into 
key-paired  APL  using  the  mapping  defined  for  input  files  when  the 
file  character  set  is  EBCDIC  and  the  program  character  set  is 
KEY. 


For  output  files,  if  the  program  character  set  is  EBCDIC  and 
the  file  character  set  is  BIT,  then  the  output  is  translated  into 
key-paired  APL  using  the  mapping  defined  for  input  files  when  the 
file  character  set  is  EBCDIC  and  the  program  character  set  is 
KEY.  Next,  the  key-paired  APL  is  translated  into  bit-paired  APL 
using  Figures  4  and  5  of  Appendix  B. 

In  the  above  discussion,  character  set  translation  is  defined 
in  terms  of  several  mappings.  The  actual  program  performs  each 
translation  directly  without  a  sequence  of  translations. 

The  APL  character  set  may  not  be  used  with  the  terminal.  If 
the  specified  file  character  set  for  a  file  connected  to  the  ter¬ 
minal  is  APL,  an  error  message  is  written  to  the  terminal  (a  ? 
message)  and  a  corrected  file  character  set  is  read  or  execution 
is  terminated. 


Common  Specifications 


The  declarations  of  classes  Infile,  Outfile  and  Printfile  used 
in  this  file  system  satisfy  the  definitions  given  in  the  SIMULA 
Common  Base  Definition  with  certain  extensions.  Those  extensions 
which  apply  to  all  three  file  classes  are  described  in  this  sec¬ 
tion  and  extensions  of  the  individual  file  classes  are  described 
in  subsequent  sections. 


The  parameter  of  a  file  class  is  a  file  specification  as 
defined  above.  When  a  class  instance  is  created,  the  file  speci¬ 
fication  is  processed  as  follows: 


(1)  The  character  set  specification  is  separated  from 
the  file  specification  and  translated  into  upper 
case  under  the  assumption  that  it  is  an  .  EBCDIC 
string.  This  text  is  analyzed  and  used  to  set  the 
values  of  the  variables  fil_set  and  prog_set  as 
follows: 


Character  Set  Value 

EBCDIC  0 
KEY  1 
BIT  2 
APL  3 


If  there  is  an  error  in  the  character  set  specifi¬ 
cation,  the  entire  file  specification  together 
with  a  description  of  the  error  is  written  to  the 
terminal  and  the  user  is  asked  to  provide  a  cor¬ 
rect  character  set  specification. 

(2)  The  remainder  of  the  file  specification  is  trans¬ 
lated  into  upper  case  EBCDIC  in  accord  with  the 
program  character  set. 
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(3)  The  missing  components  of  the  fileid  are  provided 
in  accord  with  the  defaults  defined  in  Section  2. 

(4)  The  file  specification  is  checked  for  syntax 
errors.  If  there  are  syntax  errors,  a  message  is 
written  to  the  terminal  describing  the  error  (see 
get_dd_input  and  get_dd_output  in  DIALOG)  and  the 
user  is  given  an  opportunity  to  provide  a  correct 
specification. 

(5)  The  directory  Is  searched  for  the  appropriate 
entry  as  defined  in  Section  2.  If  the  file  cannot 
be  found  a  corrective  message  is  written  to  the 
terminal  and  the  user  is  asked  to  provide  a  cor¬ 
rected  file  specification  (but  not  a  character  set 
specification) .  At  this  point,  the  user  may  enter 
CMS  subset  to  look  for  the  appropriate  file  by 
selecting  the  default  answer  CMS: .  Upon  returning 
from  the  CMS  subset,  the  user  is  again  asked  to 
name  the  file. 

(6)  The  value  of  f_spec  is  set  to  the$CMS  file  speci¬ 
fication  as  translated;  the  character  set  specifi¬ 
cation  is  removed. 

If  the  file  specification  is  of  the  form  FIL: <str ing> ,  then 
the  message: 

%  Please  enter  the  file  specification  for  file  <string>: 

is  printed  on  the  terminal  and  the  user  response  is  interpreted 
as  above. 

All  of  the  usual  procedure  attributes  of  file  objects  will 
first  check  to  see  if  the  file  is  open.  If  the  file  is  closed, 
the  user  will  be  given  an  opportunity  to  open  the  file.  See  the 
code  for  the  attributes  of  class  out_file  in  file  OUTFILE  SIMULA 
for  examples. 

If  the  procedure  Open  is  called  when  the  file  is  already  open, 
a  message  of  the  form 

[%  File  '*<fileid>'’  is  already  open.] 
is  printed  on  the  terminal  and  execution  continues. 

If  the  procedure  Close  is  called  when  the  file  is  already 
closed,  a  message  of  the  form 

(%  File  '<fileid>'  is  already  closed.] 
is  printed  on  the  terminal  and  execution  continues. 
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All  file  objects  will  have  integer  procedure  attributes 
program_set  and  file_set  whose  return  value  is  the  character  set 
number  of  the  program  and  file,  respectively,  associated  with  the 
file  object. 

Each  file  class  has  a  text  procedure  attribute  file_spec  whose 
return  value  is  the  fileid  or  device  specification  of  the  file 
associated  with  the  file  object.  [Most  precisely,  the  file  spe¬ 
cification  without  the  character  set  specification.] 

Each  instance  of  a  file  class  is  to  have  two  integer  stacks 
implemented  as  linked  lists  using  SIMULA  classes  which  will  be 
used  to  store  program  and  file  character  set  numbers. 

When  the  procedure  enter_ascii  is  executed,  the  following 
occurs: 


(1)  The  current  program  character  set  number  is  pushed 
onto  the  program  character  set  stack. 

(2)  If  the  file  is  an  Infile,  Image  is  set  to 

Blanks (Image. Length) .  If  the  file  is  an  Outfile 

or  Printfile,  Outimage  is  executed  if  Image. Strip 
=/=  NOTEXT. 

(3)  The  program  character  set  number  is  set  to  0. 


When  the  procedure  enter__key  is  executed,  the  following 
occurs: 

(1)  The  current  program  character  set  number  is  pushed 
onto  the  program  character  set  stack. 

(2)  If  the  file  is  an  Infile,  Image  is  set  to 

Blanks (Image. Length) .  If  the  file  is  an  Outfile 

or  Printfile,  Outimage  is  executed  if  Image. Strip 
=/=  NOTEXT. 

(3)  The  program  character  set  number  is  set  to  1. 


When  the  procedure  restore  is  executed,  the  following  occurs: 

(1)  If  the  file  is  an  Infile,  Image  is  set  to 
Blanks (Image. Length) .  If  the  file  is  an  Outfile 
or  Printfile,  Outimage  is  executed  if  Image. Strip 
=/=  NOTEXT. 

(2)  The  current  program  character  set  is  set  to  the 
character  set  number  that  is  on  the  top  of  the 
stack  and  the  stack  is  popped. 
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The  procedures  set_ascii,  set_bit,  set_key,  set_apl  and  reset 
are  used  to  change  the  file  character  set. 

The  three  set  procedures  perform  the  following  actions: 

(1)  If  the  file  is  an  Outfile  or  Printfile,  then  if 

Image. Strip  NE  NOTEXT,  Outimage  is  called.  If  the 
file  is  an  Infile,  Image  is  set  to 

Blanks (Image. Length) . 

(2)  If  the  new  character  set  causes  a  change  from 
EBCDIC  to  BIT  or  KEY  or  conversely,  the  appropri¬ 
ate  character  set  changing  character  is  written  to 
the  device  using  Breakoutimage.  [Obviously,  for 
Infiles  a  character  is  written  only  if  the  device 
is  the  terminal.] 

(3)  The  current  character  set  number  is  placed  on  the 
file  character  set  stack  and  the  character  set 
number  is  set  to  the  new  character  set. 

When  the  procedure  reset  is  executed,  the  following  occurs: 

(1)  If  the  file  is  an  Outfile  or  Printfile,  then  if 
Image. Strip  NE  NOTEXT,  Outimage  is  called.  If  the 
file  is  an  Infile,  then  Image  is  set  to 
Blanks (Image. Length) . 

(2)  If  changing  the  character  set  to  the  top  character 
set  on  the  stack  will  cause  a  change  from  EBCDIC 
to  BIT  or  KEY  or  conversely,  the  appropriate  char¬ 
acter  set  change  character  is  written  to  the 
device  using  Breakoutimage.  [Again,  for  Infiles, 
only  the  terminal  character  set  is  changed.] 

(3)  The  character  set  number  at  the  top  of  the  stack 
replaces  the  character  set  number  associated  with 
the  file  and  the  stack  is  popped. 

Partial  implementations  of  the  set  procedures  appear  in  the 
declaration  of  class  out_file;  the  terminal  character  set  changes 
are  included;  not  all  details  match. 

The  file  classes  satisfy  the  class  structure  of  file  objects 
as  described  in  the  Common  Base  Definition  and  Simula  BEGIN . 
That  is,  a  class  File  is  declared: 

CLASS  File  (f_spec) ;  TEXT  f_spec; 

BEGIN 
•  •  • 

END  of  File; 
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All  attributes  that  are  shared  by  all  file  objects  are  declared 
in  class  File.  The  specific  file  classes  are  declared  with  the 
following  headings: 

File  CLASS  Infile; 

File  CLASS  Outfile; 

Outfile  CLASS  Printfile; 

A  Printfile  class  video_file  is  declared  with  an  additional 
boolean  attribute  blackout.  If  blackout  is  true,  a  video_file  is 
just  like  a  Printfile.  If  blackout  is  false,  all  lines  written 
to  the  file  are  also  written  to  the  terminal  with  carriage  con¬ 
trol;  see  Section  7.  The  heading  of  the  declaration  is: 

Printfile  CLASS  video_file 

It  is  usually  the  case  that  two  file  objects  (ttyin  and 
tyyout)  are  associated  with  the  terminal  and  in  some  circum¬ 
stances  there  may  be  additional  file  objects  associated  with  the 
terminal.  This  state  of  affairs  leads  to  potential  confusion 
concerning  the  actual  character  set  of  the  terminal  at  any  spe¬ 
cific  moment. 

Most  APL  terminals  can  change  the  printed  character  set  under 
program  control  and  the  specifications  for  the  file  objects 
require  changing  the  terminal  font.  However,  when  two  or  more 
file  objects  refer  to  the  terminal  they  may  have  a  different  view 
of  the  terminal  type  font  and  this  may  produce  peculiar  looking 
output  on  the  terminal  or  unexpected  interpretations  of  the  input 
lines. 

It  is  the  responsibility  of  the  file  system  to  keep  track  of 
the  type  font  that  is  in  effect  for  the  terminal.  If  the  termi¬ 
nal  type  font  is  ASCII  and  a  file  object  reads  from  or  writes  to 
the  terminal  in  character  set  KEY  or  BIT  then  it  is  the  responsi¬ 
bility  of  the  file  system  to  change  the  terminal  type  font  before 
the  read  or  write  is  performed. 

For  devices  other  than  the  terminal,  it  is  the  responsibility 
of  the  program  to  control  the  type  font  of  the  device.  [It  is 
assumed  that  there  will  only  be  one  file  object  associated  with 
other  devices  but  this  is  not  a  requirement.] 

More  generally,  execution  may  not  be  terminated  for  any  error. 
If  an  error  occurs,  the  appropriate  message  is  written  to  the 
terminal  and  the  user  is  asked  for  a  corrective  response  or  to 
select  termination  of  execution. 


5.  Class  Infile 

Except  when  the  CMS  file  specification  is  TTY:,  class  Infile 
will  deal  with  end-of-file  as  in  the  system  defined  version. 


That  is,  Endfile  becomes  true  and  Image  is  set  to  /* .  Note 
carefully  the  specifications  for  Endfile;  they  are  not  what  one 
immediately  expects. 

If  the  file  specification  is  TTY;,  an  empty  line  will  be 
treated  as  an  empty  line  and  Image  will  be  set  to 
Blanks (Image. Length) .  If  the  first  character  of  an  input  line  is 
control-Z,  this  line  will  be  treated  as  an  end-of-file. 

At  each  call  to  Inimage,  a  line  is  read  from  the  input  device 
and  translated  into  the  appropriate  character  set.  In  addition, 
tabs  are  replaced  by  blanks  under  the  assumption  that  tabs  are 
set  every  eight  spaces.  If  the  result  of  expanding  tabs  in  an 
input  record  is  longer  than  Image .Length ,  the  remaining  text  is 
retained  and  used  to  satisfy  the  next  call  to  Inimage. 

When  the  procedure  Open  is  called,  if  the  length  of  the  param¬ 
eter  is  less  than  the  LRECL  of  the  file,  then  the  user  may  elect 
to  extend  Image  to  the  appropriate  length  by  answering  a  terminal 
prompt  or  elect  to  terminate  execution.  If  Image. Length  is 
greater  than  LRECL,  input  lines  are  padded  with  the  appropriate 
number  of  blanks. 

Instances  of  class  Infile  must  be  able  to  read  any  fixed  or 
variable  length  record  file  in  any  format.  The  running  program 
need  not  know  anything  about  the  file! 

APL  character  set  (BIT  or  KEY)  is  subject  to  three  typographi¬ 
cal  conventions; 

(1)  When  entering  a  line  of  input,  characters  may  be 
entered  in  any  order  with  arbitrary  numbers  of 
backspaces  and  spaces.  The  input  line  is  inter¬ 
preted  just  as  it  appears  on  the  terminal. 

(2)  After  a  line  of  input  is  read,  it  is  canonicalized 
to  remove  redundant  spaces  and  so  that  strikeovers 
are  of  the  form  <chl><bs><ch2>  and  such  that 
Rank(<chl>)  <=  Rank(<ch2>). 

(3)  After  a  line  of  input  from  the  terminal  only  is 
canonicalized,  it  is  examined  for  the  character 
<lf>  (ASCII  10,  EBCDIC  37).  If  this  character  is 
found,  the  line  is  processed  as  follows; 

(i)  All  characters  from  the  <lf>  to  the 

end  of  the  line  (including  < If > )  are 

deleted. 

(ii)  The  remaining  characters  in  the  line 
are  sent  to  the  terminal  using  Breakout- 
image. 
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(iii)  An  additional  line  of  input  is  read 
from  the  terminal. 

(iv)  A  new  string  that  consists  of  the 
remainder  of  the  first  line  and  the  sec¬ 
ond  input  line  are  concatenated  and  pro¬ 
cessing  continues  at  step  (3) 

All  Infiles  perform  input  canonicalization  as  described  above 
for  file  character  sets  KEY  and  BIT.  The  appropriate  transforma¬ 
tion  for  Infiles  with  file  character  set  APL  is  also  performed. 


6.  Class  Outf ile 

All  files  written  by  class  Outfile  are  variable  le??th  record 
files  with  the  shortest  possible  record  length. 

Each  instance  of  Outfile  is  to  have  an  internal  variable  which 
is  true  if  the  output  is  to  be  written  with  tabs  replacing 
strings  of  blanks  and  false  otherwise.  This  variable  can  be 
modified  by  means  of  the  procedures  tabs_on  and  tabs_off.  When  a 
call  to  one  of  these  procedures  is  executed,  the  value  of  this 
variable  is  changed.  By  default,  tabs  are  inserted  into  disk 
files  and  PUN:  files  and  tabs  are  not  inserted  into  TTY:  and  LPT: 
files;  the  appropriate  initialization  is  performed  when  the  class 
instance  is  created. 

The  procedure  attribute  Outtext  is  to  be  extended  so  that  long 
text  objects  can  be  written  over  several  lines  as  in  class 
out_f ile  in  file  OUTFILE  SIMULA. 

The  procedure  attribute  Outimage  is  to  be  extended  so  that 
character  set  translation  is  performed  in  accord  with  the  program 
and  file  character  set.  The  text  object  Image. Strip  after  trans¬ 
lation  and  tab  processing  is  to  be  written  to  the  output  device. 

The  procedure  Br eakoutimage  is  an  extension  of  Outfile.  Char¬ 
acter  set  translation  in  accord  with  the  file  and  program  charac¬ 
ter  sets  is  performed.  If  the  output  device  is  TTY:,  then  the 
text  Image. Sub(l, Image. Pos)  is  written  to  the  terminal  without  a 
trailing  <crxlf>  [carriage  return  —  line  feed]  and  Image  is  set 
to  Blanks (Image. Length) .  If  the  output  device  is  not  TTY:,  then 
the  text  Image. Sub (1, Image. Pos)  is  to  be  added  to  an  internal 
buffer.  Successive  calls  to  Br eakoutimage  for  such  files  simply 
extend  the  buffer.  When  Outimage  is  next  called,  the  buffer  fol¬ 
lowed  by  the  text  to  be  written  in  response  to  the  call  to  Out¬ 
image  is  written  to  the  device. 

Canonicalization  of  output  in  character  sets  KEY  and  BIT  is 
not  provided  but  in  character  set  APL  the  output  files  are  to  be 
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suitable  for  printing  with  the  APL  print  train. 


7.  Class  Pr intf ile 

Class  Printfile  has  all  of  the  attributes  of  an  Outfile 
because  it  is  declared 

Outfile  CLASS  Printfile; 

Therefore,  all  of  the  statements  about  Outfiles  apply  to  Print- 
files. 

Unless  the  output  device  is  TTY:,  the  files  written  by  Print- 
file  are  to  be  in  a  format  suitable  for  printing  with  Fortran 
carriage  control.  Output  carriage  control  may  be  written  by  the 
user  program. 

Except  for  output  device  TTY:,  Printfiles  control  pagination 
by  appending  the  character  1  to  the  beginning  of  lines  that  are 
to  be  printed  on  the  top  of  a  page.  For  output  device  TTY:,  the 
module  CLEAR  is  executed  before  writing  a  line  that  is  to  begin 
at  the  top  of  the  next  page. 


.  Class  Stream 

Infile  class  stream  is  an  extension  of  class  Infile  with  two 
extensions  of  the  procedure  Inimage. 

When  Inimage  is  executed  the  first  character  of  the  input  line 
is  examined  and  if  the  character  is  an  at  sign  (@) ,  the  remainder 
of  the  line  is  interpreted  as  a  file  specification  including 
character  set  specification.  If  the  character  set  specification 
is  omitted,  the  character  set  specification  of  the  current  file 
is  used. 

This  file  specification  is  used  to  create  another  stream  and 
input  is  taken  from  this  stream  until  an  end-of-file  is  encoun¬ 
tered.  After  the  end-of-file,  the  next  input  line  from  the  cur¬ 
rent  stream  is  processed. 

Each  call  to  Inimage  does  cause  a  line  of  input  to  be  trans¬ 
mitted  to  Image. 

If  the  first  character  of  a  line  of  input  is  not  an  at  sign 
(@) ,  then  the  value  of  Image  is  computed  as  follows. 

(1)  The  first  line  is  read  from  the  file  and  trailing 
blanks  are  removed.  If  the  result  is  a  non-empty 
string,  this  string  is  placed  in  Image.  If  the 
result  is  an  empty  string.  Image  is  set  to 

Blanks (buf .Length)  where  the  open  call  was 
Open(buf) ;  steps  2  and  3  below  are  skipped. 
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(2)  If  the  last  character  of  Image. Strip  is  the  minus 
sign  (-)  in  the  appropriate  character  set,  then 
this  character  and  the  remaining  blanks  in  Image 
are  deleted.  Another  line  of  input  is  read  from 
the  file  and  added  to  the  end  of  Image. 

(3)  Step  2  is  repeated  until  a  line  whose  last  non¬ 
blank  character  is  different  from  the  minus  sign 
is  read.  This  line  (with  trailing  blanks  removed) 
is  appended  to  Image. 

This  definition  of  the  action  of  Inimage  implies  that 
Image. Length  is  different  for  different  calls  to  Inimage. 

In  order  to  satisfy  operating  system  restrictions,  it  will  be 
necessary  to  impose  some  limit  on  the  number  of  files  that  can  be 
open  at  any  one  time.  This  limit  is  to  be  set  as  high  as  possi¬ 
ble. 


9.  General  Specifications 

All  advisory  messages  written  to  the  terminal  or  file  are  to 
be  enclosed  in  square  brackets.  Advisory  message  provide  infor¬ 
mation  but  do  not  require  a  response.  Messages  that  require  a 
response  are  not  enclosed  in  square  brackets. 

The  first  character  of  a  message  text  will  be  either  percent 
(%)  or  question  mark  (?) .  The  character  percent  indicates  that 
there  is  a  possibility  of  an  error  and  the  character  question 
mark  indicates  that  a  fatal  error  has  occurred  and  execution  must 
be  terminated  if  it  is  not  corrected.  If  execution  is  terminated 
because  of  an  uncorrected  error,  the  message 

[?  Fatal  error,  execution  terminated.] 
is  printed  on  the  terminal. 

Terminal  prompts  which  include  messages  are  to  use  the  query 
procedures  in  class  dialog.  If  there  is  no  message,  the  program 
is  to  prompt  using  Breakoutimage  as  follows:  Primary  input 
prompts  use  the  character  asterisk  (*)  ,  secondary  input  prompts 
use  the  character  sharp  (#)  and  third  level  input  prompts  use  the 
character  at  sign  (@) .  CMS  at  monitor  level  prompts  with  period 
(.)  . 


There  are  to  be  two  predeclared  and  opened  file  objects:  ttyin 
and  ttyout  which  are  the  terminal  as  input  (an  Infile)  and  the 
terminal  as  output  (an  Outfile) .  All  terminal  dialog  is  to  use 
these  devices  so  that  character  set  translation  for  messages  can 
be  performed.  A  program  writing  an  error  message  should  change 
the  program  character  set  to  the  appropriate  set  and  then  restore 
the  character  set  to  its  previous  value. 


-16- 


All  identifiers  declared  in  the  file  classes  that  are  not 
attributes  of  the  class  will  also  be  declared  PROTECTED  so  that 
they  are  not  available  from  outside  the  program.  Similarly, 
variables  whose  value  is  of  interest  outside  the  class  are  to  be 
declared  PROTECTED  and  access  to  the  variable  is  to  be  provided 
read-only  by  means  of  a  simple  procedure.  This  reduces  the  risk 
of  a  program  changing  something  in  the  file  classes  in  an  unfor¬ 
tunate  way. 

Most  of  the  file  class  code  is  to  be  written  in  SIMULA. 
Assembly  procedures  are  to  be  used  only  if  one  of  the  following 
conditions  are  satisfied: 

(1)  It  is  not  possible  to  write  the  code  in  SIMULA, 
e.g.,  the  code  to  write  to  a  file. 

(2)  An  assembly  coded  procedure  will  be  substantially 
more  efficient  than  the  corresponding  SIMULA  code. 

In  this  case  a  SIMULA  coded  version  will  also  be 
provided. 

The  motivation  for  this  specification  is  to  make  the  program 
easier  to  maintain. 

Program  text  is  to  be  formatted  with  respect  to  the  case  of 
identifiers  in  accord  with  the  default  options  of  SIMED . 

Each  procedure  or  class  declaration  will  have  an  end  comment 
of  the  form: 

END  of  <name>; 

where  <name>  is  the  name  of  the  procedure  or  class  as  formatted 
by  SIMED.  The  information  contained  in  the  comment  is,  of 
course,  helpful  when  reading  the  program.  Following  this  format 
exactly  makes  it  very  much  easier  to  edit  program  files. 

All  SIMULA  programs  will  follow  a  consistent  indentation  con¬ 
vention.  The  indentation  provided  by  SIMED  or  the  indentation 
used  in  file  OUTFILE  SIMULA  meets  this  requirement  but  other  con¬ 
ventions  are  acceptable. 

Comments  are  to  be  formatted  as  paragraphs  with  the  first 
column  of  comment  text  beginning  in  column  8  (first  ANSI  tab)  and 
the  keyword  comment  beginning  in  column  1.  The  semi-colon  termi¬ 
nating  the  comment  will  appear  on  a  line  alone  in  column  8.  See 
the  comment  in  procedure  conver t_to_apl  in  class  out  file  for  an 
example. 

Brief  explanatory  remarks  may  appear  with  the  comment  symbol 
exclamation  point  (l)  in  the  program  text.  See  procedures 
set_key_paired  and  set_ascii  in  class  out_file  for  examples. 


APPENDIX  A 


ASCII-EBCDIC  TRANSLATION  TABLES 
USED  AT 
VIRGINIA  TECH 


The  following  pages  are  an  extract  from  a  Virginia  Tech 
Center  document  that  describes  the  ASCII-EBCDIC  translation 
tables  that  are  used  with  the  Memorex  1380  Communications 
Processor  on  the  VM/CMS  System. 


In  order  to  get  started  on  defining  a  new  output  translation 
table,  I  have  attached  a  copy  of  a  proposed  table  which  would 
solve  the  problems  with  SCRIPT.  This  particular  table  may  be 
only  a  starting  point  for  deciding  on  a  new  table;  however,  what¬ 
ever  table  we  define  must  translate  at  least  one  EBCDIC  character 
into  each  ASCII  character. 

The  notations  which  I  have  used  in  the  table  are  explained  below. 

The  table  contains  256  entries  in  a  16  by  16  array.  Each  entry 
contains  two  or  three  rows.  The  first  row  of  an  entry  specifies 
an  EBCDIC  character;  the  second  row  specifies  the  ASCII  character 
into  which  I  propose  that  it  be  translated;  and  the  third  row 
specifies  the  ASCII  character  into  which  it  is  currently  trans¬ 
lated,  which  I  obtained  from  a  copy  of  the  current  table  given  to 
me  by  Mike  Elliott.  A  second  row  containing  only  #'s  indicates 
that  I'm  indifferent  to  how  that  character  is  translated  (though 
that  doesn't  necessarily  mean  that  a  change  is  not  in  order). 
The  third  row  is  omitted  if  it  is  the  same  as  the  second,  i.e., 
if  I  see  no  need  to  change  the  translation  of  that  character. 
Each  line  of  each  entry,  except  as  noted,  consists  of  a  hexadeci¬ 
mal  code  and  an  equals  sign,  followed  by  the  character  which  the 
code  represents  (if  it's  a  printing  character)  or  the  name  of  the 
character  (if  it's  a  control  character).  For  the  ASCII  hexadeci¬ 
mal  codes  on  the  second  lines  of  entries,  I  set  the  parity  bit  to 
0;  for  the  ASCII  hexadecimal  codes  on  the  third  lines  of  entries, 
I  specified  the  parity  bit  (correct  or  not)  as  it  is  in  the  cur¬ 
rent  table.  Blanks  after  the  equals  sign  indicate  that  that  par¬ 
ticular  hexadecimal  code  has  no  assigned  meaning.  "sp"  is  used 
to  indicate  a  space  character,  and  "not"  is  used  to  indicate  the 
EBCDIC  "logical  not".  A  vertical  bar  overstruck  by  an  equals 
sign  is  used  to  indicate  a  character  which  cannot  be  printed  on 
the  terminal  on  which  the  table  was  printed,  such  as  the  "hook", 
"fork",  and  "chair"  characters.  A  designation  of  "TN"  at  the  end 
of  a  line  indicates  that  that  particular  hexadecimal  code  prints 
as  that  character  with  the  TN  print  train,  but  that  that  code 
does  not  otherwise  represent  any  character. 

My  references  for  the  character  sets  were  System/370  Reference 
Summary  (the  "yellow  card")  and  System/370  Principles  of  Opera¬ 
tion  ,  Appendix  H;  EBCDIC  Chart. 

The  justification  of  the  proposed  translation  of  the  EBCDIC  cent 
sign  (X ' 4A ' )  to  the  ASCII  caret  ( X ' 5E ' )  should  be  mentioned; 
there  is  no  ASCII  cent  sign,  there  is  no  EBCDIC  caret,  and  each 
appears  as  shift/6  on  its  respective  keyboard  (at  least  on  some)  . 

There  are  a  total  of  IS  changes  in  the  proposed  table. 

Determining  the  impact  of  changing  this  translation  is  difficult. 
I  would  not  anticipate  any  problems  for  those  terminals  or  appli¬ 
cations  which  use  only  printing  characters  and  the  basic  control 
characters.  The  problems,  if  any,  will  most  likely  be  with  those 
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APPENDIX  B 

STAPL  CONVENTION 
FOR 

ASCII/APL  TERMINALS 


(Reprinted  from  APL  Quote  Quad) 


Standards 


THE  APL-ASCII  OVERLAY  STANDARD 


(Editor's  Note:  The  following  description 
of  APL-ASCII  overlays  for  typewriter-  and 
bit-paired  terminals  was  originally 
written  in  1972  by  Lawrence  M.  Breed  while 
he  was  at  Scientific  Time  Sharing  Corpora¬ 
tion.  The  proposed  standard  was  accepted 
around  that  time  by  the  SHAPE  APL  User's 
Group  and  is  reproduced  here  by  request  of 
the  STAPL  Executive  Committee  with  STSC's 
permission. ) 


Intrqtiuc&ion 

APL,  a  programming  language  developed 
by  IBM  and  now  offered  by  several  manufac¬ 
turers,  is  particularly  well  suited  for 
interactive  problem  solving  in  both 
scientific  and  commercial  areas.  The  use 
of  APL  is  growing  rapidly.  More  than  50 
universities  in  North  America  are  running 
APL  systems;  commercial  APL  service  from  a 
number  of  vendors  has  been  doubling  every 
nine  months;  and  IBM  has  estimated  that  70 
percent  of  its  own  internal  time  sharing 
work  is  done  with  APL. 

Until  1972,  virtually  all  terminals 
used  with  APL  systems  were  based  on  the 
IBM  Selectric  mechanism.  Then,  however, 
terminal  manufacturers  began  to  support 
APL  features  on  ASCII  terminals.  Because 
APL  uses  a  large  number  of  graphic  charac¬ 
ters  that  do  not  occur  in  ASCII,  an 
alternate  character  set  was  needed.  Work 
toward  a  standard  for  APL  characters  on 
ASCII  devices  was  first  undertaken  by 
James  L.  Ryan  at  Burroughs  Corporation, 
and  has  since  been  advanced  by  other 
members  of  the  APL  Users  Group.  This 
standard  was  accepted  by  the  APL  Project 
of  the  SHARE  organization  and  by  STAPL;  it 
is  being  used  by  many  terminal  manufactur¬ 
ers,  computer  manufacturers,  and  APL  time 
sharing  services. 


The  AFL  Keyboard 

APL-ASCII  is  designed  to  "overlay" 
the  keyboard  of  a  94-graphic  ASCII  termi¬ 
nal  with  a  different  set  of  graphics, 
without  requiring  any  modification  of 
electronic  or  electromechanical  parts 
except  for  the  character-generating  mecha¬ 
nism  itself  (for  example,  a  dot  matrix 
read-only  memory  or  print  head ) .  Specifi¬ 
cally,  pressing  a  key  on  the  terminal 
causes  exactly  the  same  code  to  be  trans¬ 
mitted,  whether  ASCII  or  APL  graphics  are 
employed.  The  ASCII  codes  0/0  through 
1/15  have  the  same  significance  in  ASCII 
and  in  APL;  similarly,  the  control-shift 
key  positions  are  unchanged. 

The  da  facto  standard  APL  terminal  is 
the  IBM  2741  using  print  element  number 
1167987  (Correspondence  coding)  or  1167988 
(BCD  coding).  Figure  1  shows  the  IBM  2741 
APL  keyboard.  This  is  a  particularly  well 
laid  out  keyboard  for  APL  use.  The 
keyboard  groups  arithmetic,  logical,  and 
relational  symbols,  and  places  the  paren¬ 
theses  and  brackets  near  other  paired 
punctuation  characters.  As  far  as  possi¬ 
ble,  this  keyboard  associates  alphabetic 
symbols  and  function  symbols  of  similar 
shape  or  name;  for  example,  over  the  I,  0, 
and  P  are  "iota",  "circle",  and  "power". 
The  orderly  groupings  and  mnemonic  pair¬ 
ings  of  the  symbols  are  a  significant  aid 
to  the  user  in  finding  and  remembering 
symbol  positions. 

On  any  terminal  that  follows  either 
the  ANSI  typewriter-pairing  or  bit-pairing 
standard,  APL-ASCII  provides  a  keyboard 
arrangement  as  close  as  possible  to  the 
IBM  2741  APL  keyboard.  The  typewriter- 
pairing  standard  is  preferred  for  APL- 
ASCII  terminals.  Figures  2  through  5  show 
the  APL  character  set  as  it  appears  on  the 
two  ANSI  keyboards,  and  in  the  ASCII 
transmission  code.  Figure  6  displays  the 
ASCII  transmission  codes. 

One  side  effect  of  producing  nearly 
identical  APL  keyboards  overlaying  the  two 


ANSI  Way boards  is  that  the  particular 
ASCII  character  overlaid  by  an  APL  char¬ 
acter  nay  depend  on  which  ANSI  Iceyboard 
arrangement  is  used.  This  is  of  no 
concern  to  the  APL  user,  but  must  be 
resolved  by  the  APL  tine  sharing  systen. 
The  first  character  of  an  APL  sign-on  is 
always  a  right  parenthesis  (ASCII  code  2/2 
on  a  typewriter-pairing  terminal;  2/10  on 
a  bit-pairing  terminal).  Once  the  tine 
sharing  system  recognizes  one  of  these  two 
codes,  it  knows  which  code  to  use  for 
every  other  APL  character. 

Some  terminals  deviate  slightly  from 
ANSI  Iceyboard  standards.  If  a  terminal 
does  not  adhere  to  the  standards  for 
positions  of  ASCII  characters,  the  trans¬ 
mission  codes  for  APL  characters  must  not 
be  changed.  The  discrepancy  must  be 
reflected  in  the  APL  Icey  positions,  not 
the  transmission  codes. 


Additional  APL  Characters 

ASCII  contains  codes  for  94  different 
graphics,  or  six  more  graphics  than  are  on 
an  APL  Selectric  typesphere.  Six  addi¬ 
tional  graphics  in  the  APL-ASCU  standard 
are : 

$  Dollar  Sign  0  Diamond 

Left  Tack  -t  Right  Tack 

{  Left  Brace  }  Right  Brace 

Except  for  the  diamond  symbol,  these 
characters  have  significance  in  APL  only 
as  printable  graphic  characters. 


Dae  of  Backspace  and  Overstock 
Characters 

Unlike  other  programming  languages, 
APL  makes  heavy  use  of  overstruck  charac¬ 
ters  formed  by  typing  one  character,  back¬ 
spacing,  and  typing  a  second  character. 

Two  examples  of  APL  overstrikes  are  4> 
which  is  formed  from  o  and  | ,  and  !  which 
is  formed  from  .  and  ' . 

Backspacing  does  act  delete  input 
characters .  Input  correction  is  accom¬ 
plished  by  backspacing  to  position  the 
cursor,  then  depressing  LINEFEED.  Depres¬ 


sing  LINEFEED  has  the  effect  of  deleting 
all  characters  above  and  to  the  right  of 
the  cursor.  Backspacing,  then  forward 
spacing,  does  not  alter  the  meaning  of  any 
previously  entered  characters.  In  APL, 
entering  the  sequence 

A , space , C , backspace , backspace , B , return 

has  the  same  effect  as  entering  the 
sequence 

A, B,C, return 

These  APL  input  editing  rules  are 
used  by  nearly  all  APL  systems,  and 
terminals  that  cannot  physically  backspace 
or  terminals  that  erase  buffered  input 
when  backspace  is  pressed  are  at  a  dis¬ 
tinct  disadvantage  in  the  APL  terminal 
market . 


HlaaaJJLanaflua 

Replacing  ASCII  graphics  by  APL 
graphics  in  the  terminal’s  print  mechanism 
is  the  one  important  modification  to  an 
ASCII  terminal  to  permit  its  use  with  APL. 
Most  APL  terminal  manufacturers  offer  APL- 
engraved  keycaps  as  well,  at  least  as  an 
option;  however,  APL  users  have  been  known 
to  work  successfully  with  Just  a  picture 
of  an  APL  keyboard  propped  up  in  front  of 
them.  One  low  cost  alternative  to  engrav¬ 
ing  a  new  set  of  keycaps  is  to  affix  APL 
decals  to  the  fronts  of  the  keys.  With 
this  approach,  both  ASCII  and  APL  charac¬ 
ters  can  be  shown  on  the  keyboard  without 
cluttering  the  keytops. 

Several  APL-ASCII  terminals  contain 
character  generators  for  both  APL  and 
ASCII,  so  the  terminal  can  be  used  easily 
in  either  environment.  Selecting  the 
desired  code  is  done  either  with  a  switch 
on  the  terminal,  or  by  transmission  of 
Shift  Out  (SO)  to  shift  from  ASCII  to  APL, 
and  Shift  In  (SI)  to  shift  from  APL  to 
ASCII. 


When  the  APL-ASCII  standard  was  first 
proposed  in  1972,  it  came  at  a  very 
appropriate  time.  The  variety  of  termin¬ 
als  that  can  support  APL  has  increased 
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dramatically,  and  nearly  all  API  systems 
now  support  these  terminals.  Thus, 
agreement  on  APL  codes  and  keyboards  is 
important.  Designers  of  all  these  termin¬ 


als  and  systems  have  accepted  APL-ASCII, 
because  it  calls  for  a  minimum  of  re¬ 
education  for  the  time  sharing  user  and  a 
minimum  of  change  to  terminal  hardware. 

i 


Figure  1.  IBM  2741  APL  Keyboard 
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Figure  3-  APL-ASCII  Bit-Pairing  Keyboard 
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Figure  6.  ASCII  Transmission  Codes 
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APPENDIX  D 

SOURCE  LISTING  OF 

CLASS  DIALOG 


DIALOG  SIMULA  dialod  ~  terminal  input  ouera  procedures 
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